Taxonomy-based Adaptive Web Search Method
نویسندگان
چکیده
Current crawler-based search engines usually return a long list of search results containing a lot of noise documents. By indexing collected documents on topic path in taxonomy, taxonomy-based search engines can improve the search result qualities. However, the searches are limited to the locally compiled databases. In this paper, we propose an adaptive web search method to improve the search result qualities enabling the users to search in many databases existing in the web space. The method has a characteristic that combines the taxonomy-based search engines and a machine learning technique. More specifically, we construct a rule-based classifier using pre-classified documents provided by a taxonomy-based search engine based on a selected context category on its taxonomy, and then use it to modify the user query. The resulting modified query will be sent to the crawler-based search engines and the returned results will be presented to the user. We evaluate the effectiveness of our method by showing that the returned results from the modified query almost contain documents that will be categorized into the selected context category.
منابع مشابه
A Taxonomy-based Focused Retrieval Method for the Web Space
The problem of word ambiguity is fundamental to information retrieval in the web space. This problem originates from the use of very short queries which is common in web information retrieval [1]. One way to deal with this issue is to provide taxonomy to the user so that the user can express his/her query intent to the system by using it. This approach is taken by existing taxonomy (directory)-...
متن کاملSAGE Agent for the SATELIT Web-based system
This article presents SAGE, an adaptive interface agent for the Web-based SATELIT system. This system is dedicated to developing and browsing a Web-based catalogues in the fields of natural sciences working with taxonomies. The SAGE learning agent was developed as a part of the SATELIT adaptive interface and deals with the general hypermedia problem of « getting lost in hypermedia space ». SAGE...
متن کاملAutomatic discovery of synonyms and lexicalizations from the Web
The search of Web resources is a very important topic due to the huge amount of valuable information available in the WWW. Standard search engines can be a great help but they are often based only on the presence or absence of keywords. Thus problems regarding semantic ambiguity appear. In order to solve one of them, we propose a new method for discovering lexicalizations and synonyms of search...
متن کاملTaxoGen: Constructing Topical Concept Taxonomy by Adaptive Term Embedding and Clustering
Taxonomy construction is not only a fundamental task for semantic analysis of text corpora, but also an important step for applications such as information filtering, recommendation, and Web search. Existing pattern-based methods extract hypernym-hyponym term pairs and then organize these pairs into a taxonomy. However, by considering each term as an independent concept node, they overlook the ...
متن کاملA New Hybrid Method for Web Pages Ranking in Search Engines
There are many algorithms for optimizing the search engine results, ranking takes place according to one or more parameters such as; Backward Links, Forward Links, Content, click through rate and etc. The quality and performance of these algorithms depend on the listed parameters. The ranking is one of the most important components of the search engine that represents the degree of the vitality...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002